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Abstract: The H2O Southern Galactic Plane Survey (HOPS) has observed 100 square degrees of 
f I ' the Galactic plane, using the Mopra radio telescope to search for emission from multiple spectral 

lines in the 12 mm band (19.5-27.5 GHz). Perhaps the most important of these spectral lines is the 
22.2 GHz water maser transition. We describe the methods used to identify water maser candidates 
and subsequent confirmation of the sources. Our methods involve a simple determination of likely 
candidates by searching peak emission maps, utilising the intrinsic nature of water maser emission 
- spatially unresolved and spectrally narrow-lined. We estimate completeness limits and compare 
our method with results from the Duchamp source finder. We find that the two methods perform 
similarly. We conclude that the similarity in performance is due to the intrinsic limitation of the 
i-C noise characteristics of the data. The advantages of our method are that it is slightly more efficient in 

Oh' eliminating spurious detections and is simple to implement. The disadvantage is that it is a manual 

method of finding sources and so is not practical on datasets much larger than HOPS, or for datasets 
with extended emission that needs to be characterised. We outline a two-stage method for the most 
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efficient means of finding masers, using Duchamp. 
Keywords: Masers - Stars: Formation - Techniques: Spectroscopic - Surveys 



> ■ 1 Introduct lOH. cores. However, to date most searches for these masers 

C^D , have targeted known regions of star formation and thus 

Astro physical masers hav e been known for oy er forty may suffer from biases. In order to reduce such biases 

years (|Weaver et al.l ll965l : [ Gunderma nnl ll965f ). They of candidate selection, it is important to conduct un- 

are known to occur in transitions of many chemical targeted surveys of the Galaxy for these masers. 

species, with the most common seen in hydroxyl (OH), The methanol mmt ibeam project l|Caswell et all 

methanol (CH3OH), silicon monoxide (SiO) and wa- [gpiph is one such survey, which focuses on the strongest 

ter (H 2 0). The 22.2 GHz H 2 maser transition is the and most widespread of the Class II CH3OH maserfl 

strongest and most widespread known to date. It is at 6 . 7G Hz. This survey is designed to cover the entire 

seen towards a variety of astrophysical obje cts such as Galactic p lane in a 4° wide latitude band. In the near 

low- and high-mass star forming regions ( eg. ICheung. Rank &,Tawned„,„ will be a survey on the Australian SKA 
V i |!969|;lForster fc Caswel|1999|;|Claussen et al|l996j) evolved Pathfinder radiot elescope (ASKAP), called GASKAP, 
stars (|Dickinson|l976|;|Miranda et al|200l|;|Hinkle fc Barnesj wmch will survey the sout hern Galaxy up to 10° away 
ll979l:lBarlow et al.ll996h and the centres of active galax- from the Galactic plane. One of the target spectral 
ies d Claussen et al. || l984 ). Other masers are often seen spe cies for GASKAP is OH, including the main maser 
arising from the same sites that give rise to H 2 masers. lines at L665 and i. 667 GHz. The H 2 maser line 
In particular, OH and CH3OH masers occur within the has recently been surveyed as part of the H 2 south- 
spatial resolution of observations (typically an arcsec- ern Galactic Plane Survey (HOPS), u sing the Mo- 
ond or l ess) of H 2 Q masers in high-mass star formin g pra radiot e lescope (| Walsh et al.l l201lT ). HOPS has 
regions (|Forster fc Caswelj |2000|; | Walsh et al.| |l99j). observed 100 square degrees of the southern Galac- 
This has led to the identification of a tentative evolu- tic plane at 12 mm and includes emission from the 
tionary sequence where both CH3OH and H 2 masers 2 2.2 GHz H 2 maser as one of its main target lines, 
appear early on during the formation process, with OH 

masers appearing at a more evolved stage, but with — 7— . 

, , , ■ n 1 1 Class I CH3OH masers arc collisionally pumped and 

some overlap between all three maser species. Such „ . , , ... , , , ; , v-n TT 

, , , , r arc usually associated with weak shocks whereas Class 11 

arguments are based on the statistical occurrence of C H 3 OH masers are radiatively pumped and closely associ- 

the masers towards other signposts of high-mass star ated with the early s tages of high mass star formation (eg. 

formation, such as ultracompact Hll regions and hot IVoronkov et al.|[2005l and references therein). 
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HOPS forms the basis for the work reported in this 
paper. 

With the advent of these untargeted, large scale 
surveys, the problem of finding and identifying new 
sources has become an important challenge. HOPS is 
the smallest of the surveys mentioned above, in terms 
of equivalent beam pointings in the survey area, with 
approximately 90 000 individual beams. However, the 
Mopra spectrometer delivers 4096 channels for each of 
16 simultaneous spectra yielding a dataset with nearly 
6xl0 9 independant voxels (l-b-v data elements). This 
number is prohibitively large such that searching each 
individual spectrum for maser emission by eye becomes 
impractical. In this paper, we describe the source find- 
ing method used in HOPS to identify H2O masers and 
compare its performance to the new source finding pro- 
gram, Duchamp. 

2 HOPS data 

Details of how the Mopra observations were under- 
taken a nd the resultant da ta have been described else- 
where (|Walsh et al.l I20T1T ). However, we summarise 
the relevant details here for convenience. Data were 
accumulated in square tiles that are 0.5° on each side. 
The tiles were oriented in Galactic coordinates and 
positioned either at b = +0.25° or b = — 0.25°. Each 
tile was observed twice, where one observation was 
scanned in Galactic longitude and the other in Galac- 
tic latitude. This greatly reduces (but does not com- 
pletely eliminate) scanning artifacts that appear as 
strips in the data cubes. Observing each tile twice 
also reduces the noise. The resulting data were re- 
gridded onto data cubes with 30" x 30" pixels. The 
Mopra beam at 22.2 GHz is approximately 2.2'. The 
final data cubes were typically strips 10° long and 1° 
wide. The spectral channel width in the data cubes is 
0.45 km s -1 . The main H2O maser data cube covers a 
velocity range of approximately —570 to +1290 km s _1 . 
Each H2O maser cube consists of 1203 x 125 x 2203 
pixels and is 1.3 GB in size. Since observations were 
taken under varying weather conditions and telescope 
elevations, the noise level varies by a factor close to 2. 

3 Source finding methods 

Here we outline two source finding methods that are 
generally available and their applicability to H2O maser 
finding in HOPS data. We also describe HOPSfind, 
which is the method we used to detect H2O masers in 
HOPS. 

3.1 Clumpfind 

Clumpfind l| Williams, de Geus fc BlitJl994h is an au- 
tomated source finding method that was designed to 
characterise spectral line emission from molecular clouds. 
Not only does it find regions of emission, but it also 
attempts to break up overlapping regions of emission 
into individual detections. For observations that show 
complex, extended emission, it is very useful in charac- 
terising detections by breaking up the emission along 



boun daries in an intuitive fashion (eg. IWalsh et al.l 
120071 1. However, the program is computationally inten- 
sive, with most of the computational power spent on 
identifying boundaries of extended emission and sepa- 
rating possibly overlapping detections. The H2O maser 
data from HOPS show very little overlap between in- 
dividual maser sites. Also, we do not expect any of the 
maser sites to be resolved in the Mopra beam. Thus, 
we regard clumpfind as too inefficient for our pur- 
poses. 

3.2 DUCHAMP 

DuCHAMfUl (Whitinel 1201 il l is a source finding pro- 
gram designed with large surve ys, particularly those 
associated with ASKAP, in mind (I Johnston et al.l[2007r ). 
It does not require emission to conform to a particu- 
lar shape (eg. Gaussian) or size, other than the user- 
defined inputs. It works by searching through data 
cubes for emission above a certain cutoff level and then 
merging detections, based on user-defined inputs. The 
user-defined inputs that are relevant to this work are 
the region within the cube to search, a threshold (ex- 
pressed in terms of a) over which voxels will be counted 
towards a detection, the minimum number of pixels to 
constitute a detection, the minimum number of chan- 
nels which constitute a detection and how far away 
voxels are allowed to be from each other in order to be 
considered in a merged detection. 

Duchamp also offers the option to reconstruct the 
data cube using the a trous procedure. This procedure 
allows the reliable detection of fainter sources and the 
removal of spurious detections in a noisy data cube by 
effectively removing some of the random noise. More 
details of this method can be found on the Duchamp 
website 1 . 

3.3 HOPSfind 

The limitation of any source finding method is the 
amount of time available to complete the task, so the 
choice of source finding method is one of efficiency: 
how to identify the largest number of real sources with- 
out identifying too many spurious sources in a reason- 
able time-frame. Du ring the HOPS pil ot observations 
and data processing (Wa lsh et al.l l2008l it was realised 
that each square degree of the Galactic plane is ex- 
pected to have no more than about 10 H2O masers, 
meaning that overlapping masers would be rare. Also, 
because the masers are typically spatially unresolved, 
bright (many Janskys) and narrow-lined (less than 5 kms -1 ), 
it is relatively easy to manually identify them, com- 
pared to weaker, extended emission. Therefore, we 
used the following method (which we call "HOPSfind" ) 
to identify H2O masers in HOPS: 

Each processed data cube was smoothed with a 
two-dimensional Gaussian kernel of size 90" x 90". 
From each data cube, a peak temperature map (PTM) 
was made. The PTM is made, using the miriad task 
moment and choosing the "mom=-2" option. This op- 
tion scans through the spectrum at each spatial pixel 

2 http: / /www. atnf.csiro.au/computing/software/duchamp 
Wcrsion 1.1.13 
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and finds the brightest emission, which is then copied 
to the PTM at the same position. Thus, the PTM is 
similar in appearance to the more commonly used "Oth 
moment", or integrated intensity map (IIM). However, 
the PTM has one significant advantage over an IIM. 
The HOPS IIM tends to be dominated by small er- 
rors in the baseline, if it is not completely flat. This 
is because we have a broad band (137.5 MHz across 
4096 channels), where the small baseline errors accu- 
mulate in the IIM. Until recently, radiot elescope ob- 
servations have been made with narrow bandwidths 
and with few channels, where small errors do not dom- 
inate in an IIM. It is with the advent of much broader 
bandwidths and many thousands of channels that the 
IIM becomes less reliable. We also note that attempts 
to remove the baseline are difficult because the base- 
line is usually complicated over the full extent of the 
band. A low-order polynomial may be removed from 
the baseline, improving the result, but small-scale de- 
viations still tend to be significant in an IIM. Because 
this problem is likely to affect many spectral line sur- 
veys in the future, we suggest that PTMs are used in 
favour of IIMs for locating emission. Figure [T] shows 
a comparison of an IIM (top) and PTM (bottom). In 
the IIM, it is possible to clearly identify one maser, 
but many more are seen in the PTM. 

We found that the PTMs are very useful in identi- 
fying most of the masers in the survey. However, the 
H2O maser spots appear as peaks that are typically 
broader than the spectral channel width (0.45 kms" 1 ), 
so our method can be improved by averaging adjacent 
channels together before producing the PTM. Since we 
found that the average maser line width is 1.4 km s -1 , 
we average 3x 0.45 kms -1 channels together. 

Maser candidates were then visually checked, us- 
ing the PTMs and by inspection of the spectrum and 
raw data cube at maser candidate positions. A maser 
candidate was deemed confirmed if the peak signal was 
greater than five times the noise level (5a) in at least 
two adjacent channels or if there were at least three 
adjacent channels that were 3a or greater in the raw 
(ie. unaveraged) data. We also checked for any sig- 
nals appearing in a single channel above 8a, but found 
no new masers that did not already satisfy the above 
requirements. Using this method, we identified 381 
masers (approximately 71% of all masers identified). 

H2O masers often occur with multiple peaks in the 
spectrum. Each of these peaks (called maser spots) 
are typically found spatially close to each other, on 
scales of a few arcseconds or less. The grouping of 
maser spots is referred to as a maser site. The spots 
within each site are expected to be associated with 
a single site of star formation. Multiple instances of 
maser sites are also often found within the same star 
forming cloud. Given the Mopra beam, it is not pos- 
sible to resolve a single maser site or multiple maser 
sites that are within the Mopra beam. Thus, if we see 
multiple peaks within a spectrum at a single location, 
we only assign it to a single detection, even though it 
is possible that many single detections are comprised 
of multiple maser sites. 

Any weak maser candidates that were not con- 
firmed in the initial list of 381 masers were reobserved 



with Mopra, using two minutes on-source plus two 
minutes off-source integration time, resulting in a noise 
level about y/2 lower than the initial observations. Any 
spectral features that were not common in both ob- 
servations were discarded as spurious. Unfortunately, 
because H2O masers are known to vary, it is possible 
that some real masers detected in the first observations 
were below the detection limit of the second observa- 
tions and thus discarded. 

In all, we identified 540 confirmed H2O masers, 
using the two-stage method outlined above. 

4 Source finding with DuCHAMP 

As described above, DUCHAMP is an automated source 
finding method that we can utilise to compare HOPS- 
find for detecting H2O masers. DUCHAMP uses a single 
detection threshold for the entire cube, rather than 
taking into account local noise levels, as is naturally 
done by the human eye. Therefore, in order to create 
the best data cube for DUCHAMP to search for sources, 
it is necessary to further process the data. We average 
a section of the H2O maser data cube that is free of 
line emission (500-lOOOkms" 1 ) and use the resultant 
2-dimensional image as an rms noise map. The full 
H2O maser cube is then divided by the noise map to 
effectively create a signal-to-noise cube, where the ma- 
jority of noise variations have been smoothed out. It 
is this signal-to-noise cube that we use duchamp to 
search for source. 

Successful operation of DUCHAMP requires fine tun- 
ing of the input parameters so that as many real sources 
are found, with minimal spurious detections. At some 
level, there is a trade off between how deep into the 
noise one chooses to search for weak masers and how 
much extra time is required to eliminate spurious de- 
tections. This applies equally to any source finding 
method. Our most efficient input parameters that we 
chose for DUCHAMP are given in Table [T] These input 
parameters effectively mean that there must be emis- 
sion over at least one beam in a single spectral channel 
and that a source must comprise of emission in at least 
two adjacent channels. This effectively eliminates the 
chance of finding masers that appear in only one spec- 
tral channel. However, our experience with HOPSfind 
found no such masers, so that these requirements be- 
come an effective method to remove spurious sources 
that may appear in a single channel. The requirement 
that there is emission in at least two adjacent channels 
is similar to the 3-channel averaging used in HOPSfind. 
We found that using only two adjacent channels and 
not three, as well as a cutoff level set at 3a, gives re- 
sults most closely matched to HOPSfind, allowing for 
a direct comparison. 

5 Comparison of HOPSfind with 

Duchamp 

An important consideration in the choice of any source 
finding method is how much time is required. We 
found that to search each square degree for H2O masers, 
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Table 1: Input parameters for duchamp. 



"Pat*?* mptpT 


Value 


T~)po ct\ "nti on 






Must have at least 15 


minPix 


15 


spatial pixels in one 
velocity channel 






Must include emission 


minChannels 


2 


in at least two chan- 
nels 






Must have at least 26 


min Voxels 


2G 


pixels in any channel 
in the cube 


snrCut 


3 


Pixels must be at least 
3x the rms noise level 


flag Adjacent 


true 


Channels must be ad- 
jacent to be merged 



HOPSfind typically took about 10 minutes so that the 
entire survey region can be searched in around 1000 
minutes. Most of this time is taken in transcribing 
the coordinates of detected masers to a source file 
and checking correctness of the transcribed details. 
The time taken for Duchamp to search each 10-degree 
strip was approximately 4.5 minutes so the entire 100 
square degrees of the survey region was searched in 
approximately 45 minutes. Thus, the running time 
for duchamp is significantly less than for HOPSfind. 
However, this time does not take into account the time 
required to set up each method, which is very short 
for HOPSfind but typically much longer for duchamp. 
This is due to time taken to alter input parameters 
in order to find the most efficient balance when run- 
ning duchamp. Thus, on small datasets, a manual 
source-finding method such as HOPSfind is quicker to 
implement and complete. On much larger datasets, 
HOPSfind will take much longer due to the time taken 
to catalogue each detection. 

Overall the duchamp source finding method found 
620 detections. Of the 620 detections identified by 
duchamp, 491 of them overlap with confirmed masers 
from HOPSfind. The majority of the remainder are 
considered spurious detections after visually checking 
the candidates, as was done for HOPSfind. A large 
fraction of the spurious detections using duchamp ap- 
pear to be artifacts along the scanning directions used 
in the observations. Inspection of the raw data cubes 
in these instances reveals linear features in either Galac- 
tic longitude or latitude and spurious detections are 
often made where two such scanning artifacts meet. 
Such features are easy for the human eye to identify, 
but difficult for an automated method. The scanning 
artifacts are generally of two types: a 'ripple' in the 
spectral dimension caused by a poorly subtracted base- 
line, or a region of elevated noise. 

Examples of spurious detections are shown in Fig- 
ures [5] and [3] Figure shows a peak in the spec- 
trum at around — 155.3 kms -1 , consisting of at least 
two adjacent channels. These two channels are shown 



as maps from the data cube in Figure [3] The po- 
sitions of confirmed H2O masers are shown in Fig- 
ure [3] with plus symbols and the location of detec- 
tions made by duchamp are outlined with black solid 
lines. As can be seen from the Figure, most of the 
duchamp detections coincide with confirmed masers. 
However, there are three duchamp detections which 
do not coincide with confirmed H2O masers at (l,b) 
= (25.42, -0.25), (25.43, -0.38) and (25.28, -0.25). 
The duchamp detection represented in Figure [5] is lo- 
cated at (25.43, —0.38). The two channels of the data 
cube do not show a spatially confined peak of emission 
at this position. Instead, both channels show posi- 
tive scanning artifacts that go through this position. 
The channel at —155.3 kms - shows a scanning arti- 
fact at a Galactic longitude of —0.38° and the channel 
at — 155.7kms _1 shows a weaker scanning artifact at a 
Galactic latitude of 25.42°. Thus, duchamp identifies 
the convergence of these two artifacts as a detection. 
Inspection of the data cube at the positions and ve- 
locities of the other two duchamp detections that do 
not coincide with a confirmed H2O maser indicate that 
they too are the result of scanning artifacts. 

It may be possible in the future to automatically 
eliminate some of these spurious detections by impos- 
ing a limit on the maximum size, or number of pixels 
that constitute a detection. The maximum size would 
be set at slightly larger than the beam FWHM. This 
is because we assume that H2O maser detections will 
always appear unresolved in our observations. Unfor- 
tunately, it is not possible to specify a maximum num- 
ber of pixels in duchamp. Even if this were possible, 
care must be used in removing such spurious detec- 
tions, since very strong masers will also be identified 
as spurious, as emission above the detection threshold 
will cover a larger area than the beam FWHM. This 
can be seen in Figure [3] where the duchamp detection 
associated with the brightest H2O maser, located at 
(25.83, -0.17), includes pixels that cover an area much 
larger than the beam FWHM. It is likely that a two- 
stage approach will be most useful: The first stage 
identifies strong masers above a high cutoff level, with 
no limitation on the maximum number of pixels in a 
detection. The second stage uses a low cutoff level 
to search for the weakest masers and also uses a maxi- 
mum number of pixels to eliminate spurious detections 
much larger than the beam size. 

Another approach is to remove scann i ng art ifacts 
prior to running duchamp. iPurcell et al ] (1201 ll ) have 
successfully demonstrated that most baseline ripples 
may be removed from HOPS data by fitting low-order 
polynomials to the line-free channels so that the per- 
formance of duchamp is greatly improved. Scanning 
artifacts which manifest as regions of elevated noise 
may be mitigated by dividing each spectral plane by 
a 'noise-map' to create a signal-to-noise cube, as de- 
scribed above. 

Of the 129 detections found by duchamp, but not 
confirmed as H2O masers, we find 11 of them that 
are possibly real detections, based on their size, shape 
and spectra, that were not identified using HOPSfind. 
These maser candidates await followup observations 
with Mopra to confirm them as real masers, or other- 
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wise. 

5.1 Comparison of completeness lim- 
its 

An alternate method of comparing our source finding 
method to duchamp is to test the efficiency of both 
methods in detecting weak sources that have been arti- 
ficially created in a source- free data cube. Note that we 
used this method in HO PS to determine th e complete- 
ness limit of HOPSfind ijWalsh et al.ll201lh and details 
can be found therein. We used the HOPS data within 
the 1=296 to 297 square degree as our template, in 
which we did not detect any confirmed masers, but has 
noise characteristics representative of the full survey. 
Using the miriad task imgen, we injected five sources, 
with typical H2O maser characteristics (ie. FWHM of 
1.4kms _1 ), at random positions and velocities into the 
data cube and searched for detections using HOPSfind 
and duchamp. This was repeated twenty times so that 
in total 100 sources were searched for. We varied the 
intensity of the generated sources to four values, equiv- 
alent to a peak of 5.5, 6.7, 8.4 and 11.1 Jy. The rms 
noise level in the data cube was 1.5 Jy. We also per- 
formed the source-finding on a signal-to-noise cube, as 
described in EJS] 

Figure [4] shows the success rate of detecting the 
generated sources in the data cube for HOPSfind and 
duchamp. We find that the methods have very similar 
success rates in detecting the generated sources, with 
HOPSfind detecting around 20% more sources than 
duchamp. We have used the same cutoff threshold for 
duchamp and in this one square degree data cube, it 
only detected one spurious source. It would be possi- 
ble to lower the cutoff and increase the detection rate 
to more closely match that of HOPSfind, but this in- 
troduces more spurious detections that must later be 
removed. We note that duchamp can also perform an 
d trous wavelet reconstruction, which determines the 
amount of structure present and then removes random 
noise from the cube. This can significantly improve the 
source detection statistics for duchamp. Duchamp, 
with the d trous wavelet reconstruction incorporated 
is shown in Figure [3] as the dotted line with crosses. 
For this implementation, we used three dimensions for 
reconstruction (ie. the full cube is reconstructed in 
one go) and a reconstruction threshold of 2.5. It can 
be seen that with the d trous wavelet reconstruction 
method, duchamp does indeed perform better, with 
results similar to HOPSfind. 

We note that H2O masers exhibit a range of line 
widths, whereas to estimate completeness limits, we 
have only injected sources with a single line width. 
However, we do not expect this will significantly affect 
our completeness comparison. This is because firstly 
85% of all masers have linewidths within a factor of 
two from the average 1.4kms _1 . Secondly, we ex- 
pect both HOPSfind and duchamp to perform sim- 
ilarly in detecting either unusually small-FWHM or 
large-FWHM masers. Thus, whilst the absolute com- 
pleteness limits may be affected by the linewidth of the 
maser emission, we do not expect there to be signif- 
icant differences between the relative performance of 



HOPSfind and duchamp. 

These results generally show that the final results 
from each source finding method are very similar. Our 
intention is not to show that one source finding method 
is better than any other, but we imply that the effi- 
ciency of any source finding method is naturally lim- 
ited by the noise in the data. Thus, well-tuned source 
finding methods will produce similar results and the 
choice of source finding method should be based on 
the ease of use, as well as other functionality, such as 
the ability to characterise extended sources. 

6 Data Visualisation 

Cross-matching HOPS datasets with complementary 
Galactic plane surveys is a key step towards achieving 
the science goals of the project. For example, matching 
NH3 clouds and H2O masers to featur es in the mid- 
infra red Midcourse Space Experiment l|Egan fc Pried 
1996) makes it possible to determine the physical prop- 
erties of star forming regions. To facilitate visualising 
the data we used the Chromoscop^\ software to directly 
compare large-scale HOPS images with other surveys. 
Chromoscope is an all-sky image-viewer which allows 
the user to easily move around the sky, fade between 
co-registered images and zoom-in to inspect features 
of interest. Because Chromoscope is javascript based 
it will run in any modern web-browser and is an ideal 
tool for publicising images online to the general pub- 
lic. However, it does not require an internet connection 
and has been used to present data at conferences, and 
to select interesting objects for follow-up studies. The 
software supports Keyhole Markup Language (KML) 
files, used to annotate positions of objects in the popu- 
lar Google s/o?0. Tools are also provided to convert as- 
tronomical catalogues to KML format for overplotting 
on the images. If an internet connection is available 
the Chromoscope can utilise the astronomical name re- 
solver 'lookUPlj to find and display specific astronomi- 
cal objects. The HOPS data is presented on Chromo- 
scope at |http://www.ast.leeds.ac.uk/hops/chromoscope| 

7 Conclusions 

We have compared two methods for finding H2O maser 
sources in the HOPS data. First, we used HOPSfind, 
which is a manual method of searching for signals in a 
peak temperature map, that is binned in such a way 
as to give the greatest contrast to typical H2O maser 
profiles. Second, we used duchamp, which is an auto- 
mated method of finding masers. 

Overall, we find the two methods of detecting sources 
to perform similarly well. In order to detect weak 
masers, it is always necessary to also accumulate spu- 
rious detections, which must later be assessed through 
followup observations. We find some minor limitations 
to the duchamp method because spurious detections 

4 http:/ /www. chromoscope. net 
https: // github . com /slowe /Chromoscope / 
5 http : / / www. google, com /sky 
6 http: / /www. strudel.org.uk/lookUP/ 
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that are the result of scanning artifacts are not flagged 
out. These limitations could be overcome in the future 
by rejecting candidate detections that do not conform 
to the beam size and shape or by running duchamp 
on a signal-to-noise cube which has had lower-order 
polynomials subtracted from the baselines. 

The duchamp method finds proportionately more 
spurious sources than HOPSfind, given the same de- 
tection limit. However, when including the a trous 
wavelet reconstruction method, we find similar detec- 
tion efficiencies. This is because the detection effi- 
ciency is not strongly dependant on the method used, 
but rather the noise characteristics of the data. Thus, 
the choice of source finding method should be based 
on ease of use and other features, rather than detec- 
tion efficiency. We conclude that HOPSfind is more 
appropriate to detecting H2O masers in HOPS, since 
they are few and far between, as well as being spa- 
tially unresolved, so that no extended emission needs 
to be characterised. For surveys of extended emis- 
sion, such as the ammonia inversion transitions de- 
tected in HOPS l|Purcell et al.ll201lh . duchamp is a 
better source finding method to use, since it automat- 
ically characterises the extended emission. It is ex- 
pected that in future surveys, such as those planned 
with ASKAP, many more sources will be detected, 
making a manual method like HOPSfind impractical. 
For example, GASKAP expects to find around 15 000 
OH masers. In addition to this, extended emission 
from thermal line and continuum emission needs to be 
characterised, making duchamp the more appropriate 
source finder. 
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Figure 1: Examples of moment maps used to iden- 
tify H2O mascrs in HOPS. (Top): Integrated in- 
tensity, or 0th moment map, where only one maser 
can be clearly seen at 1 = 25.83, b = —0.18. (Bot- 
tom): Peak temperature, or —2 moment map, 
showing 10 masers. 
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Figure 2: Spectrum of the spurious detection made 
by duchamp at the position (1, b) = (25.43, 
—0.38). The strongest peak is found at a veloc- 
ity of — 155. 3 km s -1 , with an adjacent channel at 
-^^kms" 1 . 
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Velocity: -155.37 km/5 




Figure 3: Two adjacent velocity channels are 
shown in the G025— 026 region. The plusses in- 
dicate positions of confirmed H2O masers and the 
regions outlined with black lines are the results 
of DUCHAMP. Within the held of view, there are 
three DUCHAMP detections that arc not associ- 
ated with confirmed H 2 masers and are most 
likely spurious detections centred on (l,b) = (25.42, 
-0.25), (25.43, -0.38) and (25.28, -0.25). 
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Figure 4: The number of successful detections of 
generated sources, out of a possible 100, is plot- 
ted against the peak flux density of the generated 
source, using HOPSfmd (solid line and open cir- 
cles), DUCHAMP (dashed line and filled triangles) 
and DUCHAMP with a trous wavelet reconstruction 
(dotted line and crosses). The vertical lines rep- 
resent the 5 and 7 times the rms noise level. All 
methods perform similarly, with the success of de- 
tection using DUCHAMP on its own slightly lower 
than the other two. 



